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Abstract: Advanced nucleic acid-based technologies are powerflil research tools for novel 
virus discovery but need to be standardized for broader applications such as virus detection 
in biological products and clinical samples. We have used well-characterized retrovirus 
stocks to evaluate the limit of detection (LOD) for broad-range PCR with electrospray 
ionization mass spectrometry (PCR/ESI-MS or PLEX-ID), RT-PCR assays, and virus 
microarrays. The results indicated that in the absence of background cellular nucleic acids, 
PLEX-ID and RT-PCR had a similar LOD for xenotropic murine retrovirus-related virus 
(XMRV; 3.12 particles per \iL) whereas sensitivity of virus detection was 10-fold greater 
using virus microarrays. When virus was spiked into a background of cellular nucleic 
acids, the LOD using PLEX-ID remained the same, whereas virus detection by RT-PCR 
was 10-fold less sensitive, and no virus could be detected by microarrays. Expected 
endogenous retrovirus (ERV) sequences were detected in cell lines tested and known 
species-specific viral sequences were detected in bovine serum and porcine trypsin. A 
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follow-up strategy was developed using PCR amplification, nucleotide sequencing, and 
bioinformatics to demonstrate that an RD114-like retrovirus sequence that was detected by 
PLEX-ID in canine cell lines (Madin-Darby canine kidney (MDCK) and Cf2Th canine 
thymus) was due to defective, endogenous gammaretrovirus-related sequences. 

Keywords: broad-range PCR amplification; electrospray ionization-mass spectrometry; 
PLEX-ID; virus microarray; RT-PCR; endogenous retroviruses 



1. Introduction 

Advanced nucleic acid-based technologies such as massively parallel or deep sequencing (MPS), 
broad-range PCR with electrospray ionization mass spectrometry (PCR/ESI-MS; PLEX-ID platform), 
and virus microarrays have been used for research investigations of a variety of clinical samples, 
environmental materials, and some biological products, thus demonstrating their capabilities for broad 
virus detection and novel virus discovery [1-5]. Additionally, viral sequences have been identified in 
various biological raw materials used for cell culture such as bovine serum [6] and porcine trypsin [7]. 
Detection of latent viruses [8,9] or novel viruses [10] has also been reported in some cell lines by using 
the new methods. The development of new vaccines for current and re-emerging diseases such as AIDS 
and pandemic influenza has necessitated the use of novel cell substrates of human, canine, avian, and 
insect origin. A major challenge regarding use of these cell lines for vaccine manufacture has been 
addressing regulatory safety concerns related to the potential presence of viruses that may not be 
detected by the routinely recommended virus detection assays for cell line safety such as latent DNA 
viruses and endogenous retroviruses (ERVs) as well as occult and unknown viruses [11]. While these 
technologies exhibit the potential for characterization of biological materials, their regulatory application 
requires rigorous and systematic evaluation including assay standardization and development of 
follow-up strategies for determining the biological relevance and significance of a positive result. 

Conventional PCR assays are highly sensitive but rely on specific primers, thus limiting detection to 
known virus sequences. Virus detection using cell-based assays is generally based upon virus replication 
and the ability to produce a visual effect such as cytopathicity or hemadsorption/hemagglutination [11]; 
therefore, latent viruses and viruses lacking such biological properties may not be detected [12]. 
Accordingly, chemical induction assays can be used to activate latent viruses to enhance their detection [13]; 
however, the induced viruses may be novel and not detected by the currently available assays. For this 
reason, it is important to evaluate new technologies such as PLEX-ID, virus arrays, and MPS for broad 
virus detection. 

The PLEX-ID platform amplifies nucleic acids using broad-range PCR primers followed by 
electrospray mass-spectrometry to detect a variety of microbial agents, such as bacteria, fungi, 
mycoplasma and viruses [14—16]. Conserved regions are used to design broad-range primers for 
detection of virus families. The weight of the amplicons is determined by mass spectrometry and a 
base count (BC) composition of A, G, C, and T is generated; hence, known or novel organisms can be 
detected and identified by comparing genomic signatures to a curated BC database [15,17-19]. 
In the case of unknown microbial agents, a genomic signature is generated and the target family is 
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identified [10,15]. Currently, the PLEX-ID Biopharma viral assays consists of four panels, which as a 
set, can detect approximately 35 viral families and potentially more than 1000 species belonging to the 
targeted viral families [20]. Although the number of samples analyzed per set is limited, the PLEX-ID 
platform can run and generate data for a large number of plates in one run. 

Virus microarrays consist of oligomers or probes designed based upon conserved and specific 
sequences across multiple regions of known viruses and/or bacteria. DNA or cDNA is fluorescently 
labeled and hybridized to the arrays. A scanner detects fluorescence if the sample DNA or cDNA binds 
a probe and the data is analyzed by customized software. The Lawrence Livermore microbial detection 
array version 2 (LLMDA v.2) can detect 2000 viruses as well as 900 bacteria [4], and the ViroChip v.5 
(University of California, San Francisco, CA, USA) can detect ~2000 viruses [2,21,22]. 

In this study, we have compared the PLEX-ID, virus microarrays, and reverse transcriptase 
(RT)-PCR assays for detection of known viruses and have used the PLEX-ID for evaluation of various 
vaccine-related cell lines. Furthermore, we have developed a follow-up strategy for identification of a 
positive result detected by the PLEX-ID to determine the relevance and significance of the nucleic acid 
detection signal for safety evaluation in biologies. 

2. Results and Discussion 

2.1. Limit of Virus Detection 

Well-characterized retrovirus stocks of xenotropic murine retrovirus-related virus (XMRV) and 
simian foamy virus serotype 1 (SFV-1; recently designated as SFVmcy-1 [23]), with known infectious 
titer and number of RT-containing particles, were used to create RNA dilution panels to determine the 
LOD of these viruses using PLEX-ID, RT-PCR assays, and virus microarrays. Sf9 cells were chosen to 
provide background nucleic acids for virus spiking studies since insect viruses were not detected by 
any of these assays (discussed below; Section 2.2.1). Although the LLMDA v.5 arrays could have 
detected insect errantiviruses, the generally used bioinformatics algorithm excluded detection of these 
sequences (discussed below; Section 2.2.2). 

2.1.1. RT-PCR Assays 

hiitially, the entire dilution series of XMRV (10^-10"^) and SFV-1 (10*^-10"^) were tested (5 ^L 
sample) in the absence of background nucleic acids by RT-PCR assays to determine sensitivity of virus 
detection. Based upon the results, a subset of virus dilutions (10-10"' for XMRV and 10-10"" for 
SFV-1) were tested three times by RT-PCR assays in a background of water and in 10^ or 10"* cell 
equivalents of Sf9 nucleic acids: the results of the LOD fi-om the three assays were similar (one is 
shown in Figure 1). Complementary DNA (cDNA) sjoithesized from the virus dilution panels was 
subjected to nested PCR amplification using XMRV gag primers or SFV-1 gag primers (Figure 1, a 
and b, respectively; +RT panels). An RT-minus PCR control (-RT panels) was negative indicating the 
absence of cellular DNA contamination in the RNA preparations. The LOD in the absence of Sf9 
nucleic acids for XMRV was 3.12 particles per \iL (Table 1) and for SFV-1 was 40.2 particles per \iL 
(Table 2); however, in both cases virus detection was 10-fold less sensitive in the presence of 10^ cell 
equivalents of Sf9 nucleic acids. It is noteworthy that the sensitivity of SFV-1 detection in a 
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background of lO"* cell equivalents of Sf9 nucleic acids was similar to virus detection in the absence of 
Sf9 nucleic acids. (Table 2; not done for XMRV). 

Figure 1. Determination of retrovirus LOD using RT-PCR assays. Total RNA was 
extracted from each virus dilution to create XMRV and SFV-1 RNA panels (10*'- 10"') in 
the absence and presence of 10^ or lO"* cell equivalents of Sf9 total nucleic acids as described 
in Materials and Methods. A subset of the RNA panels (10~^-10~^) was subjected to nested 
RT-PCR assays. An RT minus (-RT) control of the samples shows the absence of residual 
cellular DNA and the negative, no template control for the PCR shows the absence of 
contamination in the assay, (a) XMRV gag primers; (b) SFV-1 gag primers. The size of 
fragments in the 100 bp marker (M) is indicated in base pairs (bp). 



-Sf9 + Sf9 (10°) 




Table 1. Limit of detection using XMRV RNA panel. 





Sample 




PLEX-ID 


ViroChip 


RT-PCR ' 




RNA DUution " 


TCIDso/mL 


Particles/^L " 


(-)Sf9 


+10^ 


(-)Sf9 


+10^ 


(-)Sf9 


(-)Sf9 


+10^ 


10"^ 


10'-^ 


312 


+ 


+ 


+ 


nt" 


nt 


+ 


+ 


10-^ 


10°^ 


31.2 


+ 


+ 


+ 


nt 


+ 


+ 


+ 




10-0.5 


3.12 


+ 


+ 


+ 




+ 


+ 






lQ-1.5 


0.312 






+ 




+ 






10-' 


10-2.5 


0.0312 








nt 









" Results from nested PCR with XMRV gag primers; Viral RNA dilution series were spiked into RNase/DNase 
free water [(-)Sf9] or present (+) in 10^ or lO" cell equivalents of Sf9 total nucleic acids; " Calculated based upon 
RT activity determined by TSF-PERT assay; * Not tested. 
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Table 2. Limit of detection using SFV-1 RNA panel. 





Sample 




PLEX-ID 


LLMDA 




RT-PCR " 




RNA DUution " 


TCIDso/mL 


Particles/^L ' 


(-)Sf9 


(-)Sf9 


+10' 


(-)Sf9 


+10^ 


+10' 


10"' 


10'^ 


402 


nt" 


nt 


nt 


+ 


+ 


nt 


10-^ 


lO^-^ 


40.2 




+ 


nt 


+ 




+ 


10-^ 




4.02 




+ 










10"* 


jQ-0.5 


0.402 












nt 



^ Results from nested PCR with SFV-1 gag primers; Viral RNA dilution series were spiked into 
RNase/DNase free water [(-)Sf9] or present (+) in 10^ or 10'' cell equivalents of Sf9 total nucleic acids; 
Calculated based upon RT activity determined by TSF-PERT assay; Not tested. 



2.1.2. PLEX-ID and Virus Arrays 

Based upon the RT-PCR results, selected virus RNA samples were tested by PLEX-ID and virus 
arrays for comparative LOD analysis. Most samples were tested at least two times and a number of 
samples were independently tested in our laboratory using an on-site PLEX-ID machine and beta-test 
panels (data not shown), and at Athogen (Athogen, Ibis Biosciences, Carlsbad, CA, USA) using 
updated Biopharm Viral Assays [20]: similar results were obtained. The results obtained using the 
PLEX-ID and virus arrays to analyze the XMRV RNA panel and the SFV-I RNA panel are shown in 
Tables 1 and 2, respectively. The results of the RT-PCR assays shown in Figure 1 are also included in 
the tables for comparison. PLEX-ID detected XMRV in the RNA dilution of 10"^ both in the absence 
and presence of Sf9 total nucleic acids, which corresponded to 3.12 particles per \iL. SFV-1 was not 
detected due to the absence of virus family-specific primers in the PLEX-ID panels. 

At a quantile threshold of 0.99 [4], LLMDA could detect the 10"^ XMRV RNA dilution in water, 
which corresponded to 0.312 virus particles per [iL; however, no signal was seen in the background of 
10^ or 10'* cell equivalents of Sf9 total nucleic acids. To further evaluate the specificity of the XMRV 
signal detected with the I0~^ RNA dilution, filtered supernatant from uninfected LNCaP cells and the 
complete medium used for virus propagation were tested by LLMDA; negative results were obtained 
(data not shown). LLMDA detected SFV-1 in the 10"^ RNA dilution, which corresponded to an LOD 
of 4.02 virus particles per )j,L. 

Analysis of XMRV using the ViroChip indicated an LOD of 0.312 particles per |uL (RNA dilution 
10"^) in the absence of Sf9 nucleic acids. Only viral sequences with a threshold of at least five or more 
probes matching the viral genome with 94%-100% identity were reported. Samples in the presence of 
Sf9 total nucleic acids were not tested using the ViroChip. It should be noted that virus detection in the 
absence of Sf9 nucleic acids by LLMDA was a log more sensitive than PLEX-ID and RT-PCR; 
however, there was no virus detection using LLMDA in the background of 10^ or 10"^ cell equivalents 
of Sf9 total nucleic acids. PLEX-ID had the same LOD in the absence and presence of 10^ cell 
equivalents of Sf9 total nucleic acids and RT-PCR had a similar LOD in the absence and presence of 
10"* cell equivalents of Sf9 total nucleic acids. 
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2.2. Investigation of Cell Lines and Cell Culture Reagents 
2.2.1. PLEX-ID Analysis 

A variety of cell lines, used in research and relevant to biological products, were evaluated for virus 
detection by the PLEX-ID since this platform was sensitive even in the presence of background cellular 
nucleic acids. Total nucleic acids were extracted from cell pellets and analyzed using PLEX-ID Biopharma 
Viral Assay Kits (Athogen, Ibis Biosciences, Carlsbad, CA, USA). In addition, biological raw 
materials such as complete medium, FBS, trypsin, PBS, and ultra-Pure dH20 were also tested by 
PLEX-ID (data not shown). Bovine viral diarrhea virus 1 (BVDV-1) sequences were detected in 
complete medium and in FBS [6]. Porcine circovirus type 2 (PCV-2) sequences were detected in 
trypsin, which was expected since this virus is ubiquitous in swine [24]. PBS and ultra-pure dHaO 
were free of detectable viral sequences. 

The cell lines analyzed in this study and the PLEX-ID results are shown in Table 3. Several 
endogenous retroviruses belonging to alpharetrovirus, betaretrovirus, or gammaretrovirus genera were 
identified [25,26]. Avian leukosis virus (ALV) sequences were detected by PLEX-ID in all human cell 
lines except HeLa. Furthermore, in-depth mass spectrometry analysis of each well, revealed "unknown" 
BCs that were not present in Athogen' s curated database. Some of the peaks were found to be potential 
"unidentified" alpharetrovirus sequences, which were compared to Athogen's curated database. The 
analysis revealed that the BCs for the unidentified alpharetrovirus sequences were near-matches to 
ALV, human endogenous retrovirus K (HERV-K), or human mammary tumor virus (HMTV). This 
information revealed that the HeLa cell line, which appeared to be free of ALV sequences, did in fact 
harbor ALV-Iike sequences with a BC three mutations away from known ALV BCs. In addition, 
Rous-associated virus 2 (RAV-2)-Iike sequences were detected in Raji, A204, and HeLa cells. 
Similarly, unidentitied betaretrovirus sequences were detected in all of the human cell lines (Table 3) 
and the BCs were found to be closely related to known HERV-K BCs. This analysis revealed that 
A204 and HEK293 were not outliers, but simply harbored an endogenous betaretrovirus whose BC 
was represented in the curated database, whereas the other human cell lines harbored variants of an 
endogenous betaretrovirus not found in the curated database. In addition, simian retrovirus 1 
(SRV-l)-related sequences were detected in A204 and HEK293. Since alpharetroviruses and 
betaretro viruses were found in all of the human cell lines, a BLAST search was performed using their 
respective PLEX-ID forward and reverse primer sequences (data not shown). The analysis revealed 
multiple hits to human chromosomes, suggesting that the alpharetrovirus and betaretrovirus sequences 
detected were due to endogenous retroviral sequences known to be present in human DNA [26]. 

Simian cell lines were also examined using the PLEX-ID platform. Table 3 shows that SRV-1 and 
baboon endogenous virus (BaEV) sequences were detected in VERO. SRV-related sequences as well 
as BaEV-related sequences have previously been shown in VERO cells to be due to simian endogenous 
retroviruses (SERV) [9,27]. No unidentified BCs were found after in-depth mass spectrometry analysis. 
It should be noted that the SRV-1 -related sequences found in the human cell lines and monkey cell 
lines had closely related BCs, but were not identical. 
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Table 3, PLEX-ID analysis of cell lines. 


Cell Lines 


Alpharetrovirus-related 


Betaretrovirus-related Gammaretrovirus-related 


ALV 


RAV-2 Unidentified 


SRV-1 


Unidentified RD114 BaEV Undistinguishable 


Human 










A549 




+ 




+ 


Raji 


+ 


+ + 




+ 


MRC-5 


+ 


+ 




+ 


A204 


+ 


+ + 


+ 


+ 


HEK293 


+ 




+ 


+ 


293T/17 ^ 


+ 


+ 




+ 


HeLa 




+ + 




+ 


Simian 










VERO 






+ 


+ 


CV-1 






+ 


+ 


Canine 










MDCK 








+ + 


CI2T11 








+ + 



^ Simian virus 40 (SV40) is constitutively expressed in 293T/17 cells and was detected, but not listed in the table; 
+, detection of positive signal, or a blank space is left in the absence of a signal. 



PLEX-ID analysis of the canine MDCK and CGTh cell lines identified RDl 14-like gammaretrovirus 
sequences. Additionally, "undistinguishable" gammaretrovirus sequences were found in both cell lines 
whose BC matched several known gammaretro viruses (Table 3). Since the identification of RDl 14-like 
sequences in the canine cell lines was unexpected, a follow-up strategy was developed to understand 
the origin of this viral hit (Section 2.3). 

In addition to endogenous retroviruses, simian virus 40 (SV40) sequences were detected by 
PLEX-ID in 293T/17 cells, which constitutively expresses the large T antigen (footnote. Table 3) [28]. 
However, other viral sequences known to be present in the test cell lines were not detected; these 
included: adenovirus sequences in 293T/17 and HEK293 cells [29], Epstein-Barr virus (EBV) sequences 
in Raji cells [30] and human papillomavirus (HPV) sequences in HeLa cells [31]. Several insect cell 
lines (SL2, Sf9, and Hi Five), which contain insect ERVs or errantiviruses [32], were also examined 
but no viral sequences were detected (data not shown). Moreover, a novel rhabdovirus that was 
reported in insect Sf9 cells [33] was not detected by PLEX-ID even though broad-range primers 
designed to detect members of the Rhabdoviridae family were included in the PLEX-ID Biopharma 
Porcine origin viral assay used to screen the cell lines. 

2.2.2. Virus Arrays 

A subset of cell lines tested by PLEX-ID (Raji, MDCK, VERO, Sf9, and SL2) was also analyzed 
using virus arrays of which VERO and Sf9 were tested at least two times. At a quantile threshold 
of 0.99, LLMDA detected HERV-K in VERO and Raji cells but other sequences detected by PLEX-ID 
such as SERV and BaEV-related in VERO and alpharetrovirus sequences (ALV- and RAV-related) in 
Raji cells were not detected. Furthermore, LLMDA did not detect RDl 14 sequences in MDCK cells 
that were found by PLEX-ID. Conversely, EBV was detected in Raji cells by LLMDA, but not by 
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PLEX-ID. No viruses were detected in insect Sf9 and SL2 cell lines by PLEX-ID, however, the 
updated LLMDA v. 5 was able to detect the nine known errantiviruses in Sf9 cells [32] and muhiple 
ones in SL2 cells (data not shown) using a modified bioinformatics analysis and a speciahzed 
retroelement database designed by LLMDA [4]. However, a novel rhabdovirus that was found in the 
Sf9 cell line [33] was not detected by LLMDA. 

MDCK and VERO were also tested using the ViroChip (data not shown): BaEV- and Mason-Pfizer 
monkey virus (MPMV)-related sequences, which are related to SRV-1, were detected in VERO; 
RDl 14 was not detected in MDCK cells. 

2.3. Follow-Up Strategy for a Positive Hit by PLEX-ID 

The detection of a positive signal in cells using the nucleic acid-based technologies can be due to an 
exogenous virus infection or endogenous retroviral sequences that may be expressed as infectious or 
defective particles. Therefore, it is important to distinguish the origin and nature of the signal. 

2.3.1. PCR Amplification and Nucleotide Sequencing 

A follow-up strategy was developed to confirm and identify RDl 14 retroviral sequences detected 
by PLEX-ID in MDCK and Cf2Th cells. In an effort to determine the origin of the PLEX-ID RDl 14 
signal we used PCR primers that targeted the same highly conserved region of the gammaretrovirus 
pol gene as in the PLEX-ID analysis. Total nucleic acids extracted Irom MDCK cells were subjected to 
a nested PCR assay as described in materials and methods. Since the first round of PCR amplified a 
slightly longer fragment Irom the MDCK cells than the nested PCR, we used the 74 bp fragment 
instead of the 64 bp fragment for fiarther analysis (Figure 2a). The DNA fragment, designated as 
MDCK74, was extracted by gel purification and nucleotide sequences determined. Comparison with 
the RDl 14 virus genome indicated 97% sequence identity (shown in Figure 3b). 

Figure 2, Follow up analysis of the PLEX-ID RDl 14 signal, (a) A 74 bp RD114-like 
gammaretrovirus fragment was amplified from total nucleic acids extracted from MDCK 
cells, as described in materials and methods; (b) Blastn analysis was conducted using 
the 74 bp fragment nucleotide sequence as the query against the Canis lupus familiaris 
genome database (TAXID: 9615) and several BAC clones were identified. An alignment 
highlighting different areas of similarity between RDl 14, several chromosomal Canis 
lupus familiaris BAC clones (designated A-L; see Table 4 for details), and the MDCK 
PCR fragment are shown. 



a. b. 
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Figure 3. Molecular phylogenetic analysis and nucleotide sequence comparison of RDl 14-like 
CfERVs pol region 3. (a) A maximum likelihood tree based on the Kimura 2-parameter 
model was constructed using a conserved region of the pol region from BAC clones A-L 
(see Figure 2b) and distant homologs, HTLV-1 and ALV [34]. The bootstrap consensus 
tree inferred from 1000 replicates [35]. The percentage of replicate trees in which the 
associated taxa clustered together in the bootstrap test (1000 replicates) is shown next to 
the branches [35]. The analysis involved 15 nucleotide sequences. Codon positions included 
were 1st + 2nd + 3rd + Noncoding. All positions containing gaps and missing data were 
eliminated. There were a total of 134 positions in the final data set. Sequence MUSCLE 
alignment and evolutionary analyses were conducted in MEGA5 [36]; (b) Nucleotide 
sequence comparison of the MDCK74 sequence with the analogous region in RDl 14 virus 
and CfERVs is shown. Nucleotide differences are indicated in red. 



a 



88 



93 



27 



73 



IZ 



56 



C 



36 



IZ 



CffiRV-A 
CfERV-B 

CfERV-L 

CfERV-J 
CfERV-D 
CfERV-K 
CfERV-I 

CtlRV-C 
CfERV-E 
CfERV-F 
CfERV-H 
CfERV-G 

— RDl 14 pol 

— HTLVpol 

— ALV pol 



MDCK -• TGGGGATTGATTGGAAATTACATTGTGCATATAGGCCCCAGAGCTCAGGACAGGTAGAAAGAATGAATAGAACA 
RD114 - TGGGGATTAATTGGAAATTACACTGTGCATATAGGCCCCAGAGCTCAGGACAGGTAGAAAGAATGAATAGAACA 
C^ER^■-A, B . GATTGGAAATTACATTGTGCATATAGGCCCCAGAGCTCAGGACAGGTAGAGAGGATGAACAGAACA 
CJERV-C^ . GATTGGAAATTACATTGTGCATATAGGCCCCAGAGCTCAGGACAGOTAGAGAGGATGACTAGAACA 
CfERV-D. ^ GATTGOAAATTAGATTGTGCATATAGGCCCCAGAGCTCAGGACAGGTAGAGAGGATGAACAGAACA 
CIERV-E.TGGGGGCAGATTGGAAATTACAT-GTGCATATAGGCCCCAAAGCTCAGGACAGGTAGAGAGGATGAACAGAACA 
CIERV-F- - GATTGGAAATGACATACTGCATATAGGCCCCAGAGCTCAGGACAGGTAGAGAGGATGAACAGAACA 
CtERV-G- - GATTGGAAATGACATTGTGCATAGAGGCCCCAGAGCTCAGGACAGGTAGAGAGGATGAACAGAACA 
CffiRV-H. • GATTGGAAATGACATCGTGCACATAGGCCCCAGAGCTCAGGACAGGTAGAGAGGAAGAATACAACA 

CfERV-I- - GATTGGAAATTACATTGTGCATGTAGGCCCCAGAGCTCAGGACAGGTAGACAGGATGAACAGAACA 
CfERV-J . . GATTGGAAATTATATTGTGCATATACGCCCCAAAGCTCAGGACAGGTAGAGAGGATGAACAGAACA 

CfERV-K GATTGGAAATTAGATTGTGCATATAAGCCCCAGAGCTCAGGACAGGTAGAGAGGATGAACAGAACA 
CfERV-L. ^ GATTGGAAGTTACATTGTGCATATAGGCCCCAGAGCTCAGGACAGGTAGAGAGGATGAACAGAACA 



2.3.2. BLAST and RetroTector® Analysis 



To identify the origin of the RDl 14-like sequence, a blastn search was performed using the sequence 
of the MDCK-RD114 PCR fragment to interrogate the Canis lupus familiaris (TAXID: 9615) database. 
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Sequence hits with strong E-values ranging from 9e-23 to 2e-17 were found within several BAC 
clones (designated as A-L in Figure 2b and listed in Table 4). Since the PLEX-ID amplicon was 
identified as RD114 and the sequence shared strong sequence identity with several Canis lupus 
familiaris BAC clones, an alignment was done with RDl 14 complete retrovirus genome (Genbank ID 
AB705392.1) as the query and the identified Canis lupus familiaris BAC clones (A-L) as the subject 
entries (Figure 2b). Although PLEX-ID identified viral sequences belonging to RDl 14, the blastn 
alignment revealed three small regions (region 1, 2, and 3) with high alignment scores (80-200 and 
>200) to RDl 14 gag and pol (Figure 2b). Each of these BAC clones was located in different regions of 
chromosome 1 or in different chromosomes altogether. Since gammaretrovirus-like elements have 
been shown to exist in abundance throughout the Canis lupus familiaris genome [37], it is likely that 
the viral sequences detected by PLEX-ID belong to RDl 14-like ERVs. 

To determine the retroviral genome structure of the RD114-related ERVs within the Canis lupus 
familiaris genome, the identified BAC clones were analyzed using RetroTector® [38], a fi-ee-share 
software designed to detect and characterize entire or fragmented ERVs in chromosomal DNA. 
Proviral motifs were found in at least one location within each BAC clone; the range spanned by the 
provirus is listed in column 5 of Table 4. In some cases, two proviral sequence chains (seq) were 
detected within a single BAC clone. For example, clones xx-73E6, and xx-219I19 each harbored two 
potential proviral regions; one provirus sequence chain shared similarity with RDl 14 (e.g., CfERV A 
and D) and the other sequence chain did not (e.g., CfERV M and N). It is likely proviruses that did not 
have homology to RDl 14 were related to other retroviruses [37]. 

Different RDl 14-like ERVs were found in several chromosomes including chromosome 1, 3, 8, 13, 
25, and X. Although RetroTector detected many provirus motifs [38], many were small, incomplete, 
or segmented as indicated by the incomplete list of motifs shown in column 7 of Table 4. Some 
proviruses like those found within clone xx-73E6 and xx-35E2 appear to possess many motifs, but 
only a fraction of the motif is present. Thus, these ERVs are likely inactive as a result of genetic 
duplications, insertions, deletions, and/or frameshifts that have occurred during host duplication over 
millions of years. These variations are clearly depicted in the alignment shown in Figure 2b, where 
different degrees of RDl 14 similarity are found in certain regions within each BAC clone. 

Due to the variable degrees of similarity found between the proviral sequences, a phylogenetic 
analysis was conducted using nucleotide sequences from region 3 (Figure 2b), since this was common 
to all 12 canine ERVs including the PLEX-ID PCR fragment amplified from MDCK cells. The 
nucleotide span of region 3 within each BAC clone is defined in column 6 of Table 4. In addition, 
analogous pol regions of human T-lymphotropic virus 1 (HTLV-1) and ALV were used as distant 
delatretrovirus and alpharetrovirus homologs, respectively. We aligned the nucleotide sequences using 
both ClustalW and MUSCLE alignment options in MEGA 5.1 (Molecular Evolutionary Genetics 
Analysis program) [36] to ensure the results were robust and not dependent on the type of alignment 
used. The phylogeny was constructed using a maximum-likelihood analysis based on the Kimura 
2-parameter model (Figure 3) [34]. Bootsfrapping (1000 replicates) was used to support the confidence 
at each node [35]. The results suggest that these ERVs diverged from a gammarefrovirus, perhaps 
RDl 14, but have undergone significant genetic change as demonsfrated by the divergence within this 
set of sequences (Figure 3 a). 
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Table 4. Proviral regions within the Canis familiaris BAC clones detected by RetroTector. 



Provirus 11) 


BAC Clone XX-(Cham) 


Chr 


Accession 


RetroTector Provirus Location 




Region 3 ' 






Range 




Provirus Motiis Detected 


CfERV-A 


73E6 (seql) 


1 


ACl 84807. 10 


5394 


14,041 


9902 


10,264 


73 


5' LTR, PBS, MA, 
CA, RT, IN, TM 


CfERV-B 


309C2 (seql) 


1 


AC194581.5 


143,113 


150,196 


146,057 


146,419 


73 


5' LTR, PBS, MA, 
CA, RT, IN, TM 


CfERV-C 


412K16 (seql) 


1 


AC191 152.6 


47,384 


51,060 


47,352 


47,712 


72 


PBS, MA, CA, RT, IN 


CfERV-D 


219119 (seql) 


3 


AC186968.15 


71,780 


76,840 


73,083 


73,454 


74 


5' LTR, PBS, CA, 
NC, PR, IN, 3 'LTR 


CffiRV-E 


231B7 (seq3) 


8 


AC194586.17 


211,515 


222,372 


220,330 


220,712 


70 


5' LTR, PBS, CA, NC, PR, RT, 
IN, TM, 3' LTR 


CfERV-F 


408G5 (seql) 


13 


AC 188926.6 


167,789 


173,976 


170,817 


171,150 


68 


5' LTR, PBS, CA, 
MA, NC, PR, TM 


CfERV-G 


239F12 (seql) 


25 


ACl 86972. 16 


89,015 


94,074 


93,745 


94,113 


73 


RT,IN 


CfERV-H 


146G22 (seq2) 


X 


AC186068.17 


147,356 


151,076 


149,410 


149,580 


76 


CA, NC, PR, IN 


CfERV-I 


35E2 (seql) 


X 


AC187326.il 


106,297 


114,944 


109,230 


109,569 


72 


5' LTR, PBS, MA, CA, NC,PR, 
RT, IN, TM, 3' LTR 


CfERV-J 


306G8 (seql) 


X 


AC189722.4 


76,817 


80,919 


79,993 


80,358 


72 


PBS, MA, CA, PR, IN 


CfERV-K 


494D7 (seql) 


14 


AC 189730.3 


3268 


8924 


5573 


5939 


74 


5' LTR, PBS, CA, NC, IN, TM 
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Table 4. Cont. 



Provirus ID 


BAC Clone XX- (Chain) 


Chr* 


Accession 


RetroTector Provirus Location *■ 


Region 3 ' 




Provirus Motifs Detected ' 


Range 




CfERV-L 


224p2and282jl4(seql) 


12 


AJ630363.1 


113,754 


118,869 


115,518 115,882 


73 


PR, IN, TM 


CfERV-M « 


73E6 (seq3) 


1 




207,305 


216,077 


n/a n/a 




5' LTR, PBS, MA, OA, NO, IN, 
TM, 3' LTR 


CfERV-N ^ 


219119 (seq3) 


3 




219,296 


226,087 


n/a n/a 




5' LTR, PBS, CA, NC, PR, RT, 
IN, 3' LTR 


CfERV-0 ^ 


35E2 (seq2) 


X 




106,282 


115,165 


n/a n/a 




5' LTR, PBS, MA, CA, NC, PR, 
RT, IN, TM, 3' LTR 



^Nucleotide location of provirus on the BAC clone; Chromosomal location of BAC clone; ° Region of RDl 14 homology in Figure 2b; ''Nucleotide location of Region 3 on the BAC clone; 
° Percent sequence identity to RDl 14; *^ Long terminal repeat [LTR], primer binding site [PBS], matrix [MA], capsid [CA], nucleocapsid [NC], protease [PR], reverse transcriptase [RT], 
integrase [IN], transmembrane [TM]; ^ Proviral region had no similarity to RDl 14; Proviral region was very similar to 35E2 seql. 
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3. Experimental 

3.1. Cell Lines 

Cell lines were obtained from American Type Culture Collection (ATCC, Manassas, VA, USA) 
except as indicated. Trypsin (Gibco, Grand Island, NY, USA, cat. No. 25300-054) was used for 
passaging all adherent cell lines, except the insect cell lines, which were dislodged by scraping. Each 
cell line was passaged at least three times before the cells were harvested at the passage number 
indicated below and washed twice with Ix PBS (Quality Biologicals, Gaithersburg, MD, USA, cat. No. 
1 14-058-101). After washing, 2x10^ cells were pelleted at 1200 rpm at 4 °C (AllegraTM 6KR centrifuge, 
Beckman Coulter, Danvers, MA, USA) and frozen at -80 °C for fijrther nucleic acid extraction. 

MDCK (Madin-Darby canine kidney, passage 64; ATCC, cat. No. CCL-34), HEK-293, (human 
embryonic kidney, passage 42; ATCC, cat. No. CRL-1573), CV-1, (African green monkey kidney, 
passage 39; ATCC, cat. No. CCL-70), HeLa (human cervical carcinoma, passage 100-112; ATCC 
cat. No. CCL-2), MRC-5 (human diploid fetal lung, passage 23; ATCC, cat. No. CCL-171), and 
VERO (African green monkey kidney, passage 127; ATCC, cat. No. CCL-81) were grovm in Eagle's 
minimal essential medium (EMEM, Mediatech, Manassas, VA, USA, cat. No. 15-010-CV) 
supplemented with 10% fetal bovine serum (FBS, heat-inactivated at 56 °C for 30 min; Hy clone, 
Logan, UT, USA, cat. No. SH30071.07) or 5% FBS in case of VERO, 100 U penicillin per mL and 100 
\ig streptomycin per mL (Quality Biological, cat. No. 120-095-721), 2 mM L-glutamine (Quality 
Biological, cat. No. 118-084-721), Ix nonessential amino acids (MEM-NEAA, Quality Biological, 
cat. No. 116-078-721), and 1 mM sodium pyruvate (Quality Biological, cat. No. 116-079-060). 
293T/17 (human fetal kidney, passage 21; ATCC, cat. No. CRL-11268) and A549 (human lung 
carcinoma, passage 84; ATCC, cat. No. CCL-185) were grown in Dulbecco's Modified Eagle Medium 
(DMEM; Gibco, Grand Island, NY, USA, cat. No. 11995-065) supplemented with 10% FBS, 100 U of 
penicillin per mL and 100 \ig of streptomycin per mL, and 2 mM L-glutamine. Cf2Th (neonate canine 
thymus, passage 62; ATCC, cat. No. CRL-1430) were grown in DMEM, 20% FBS, 100 U of peniciUin 
per ml and 100 |a.g of streptomycin per ml, 2 mM L-glutamine, and Ix MEM-NEAA. 

Raji (human Ijmiphoblastoid Ijmiphoma, FDA passage 5; ATCC, cat. No. CCL-86) was grown in 
RPMI 1640 (Quality Biological, cat. No. 1 12-024-101), 10% FBS, 100 U penicillin per mL and 100 ^g 
streptomycin per mL, 2 mM L-glutamine, and Ix MEM-NEAA. A204 (human rhabdomyosarcoma, 
passage 84; ATCC, cat. No. HTB-62) was grown in RPMI 1640, 10% FBS, 100 U penicillin per mL 
and 100 \ig streptomycin per mL, 2 mM L-glutamine, and 1 x MEM-NEAA. 

Sf9 (Spodoptera frugiperda ovary, passage 25; ATCC, cat. No. CRL-I71I) and High Five 
(Trichopulsia ni ovary, FDA passage 5; Invitrogen, Grand Island, NY, USA, cat. No. P/N5 1-4005) 
were growTi in Grace's Insect Medium (Invitrogen, cat. No. 11605-094), supplemented with 10% FBS 
(FBS, heat-inactivated at 56 °C for 30 min; certifred for insect cells Hyclone, cat. No. SH30070.03), 
100 U penicillin per mL and 100 )j,g streptomycin per mL, and 2 mM L-glutamine. Schneider's 
Drosophilia line 2 (SL2) {Drosophila melanogaster embryo, passage 21; D.Mel.(2); ATCC, cat. No. 
CRL-1963) was grown in Schneider's Drosophila Medium (Invitrogen, cat. No. 21720-024), 
supplemented with 10% FBS (certified for insect cells), 100 U penicillin per mL and 100 \ig streptomycin 
per mL, and 2 mM L-glutamine. 
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3.2. Preparation of XMRV and SFV-1 RNA Panels with and without Background Sf9 Total Nucleic 
Acids and RT-PCR Assays 

XMRV and SFV-1 virus stocks were prepared using LNCaP andMw^ dunni cells, respectively. The 
infectious titer for XMRV was 10^'^ TCID50 per mL [39] and for SFV-1 was 10^^ TCID50 per mL. For 
both viruses, 10-fold serial dilutions (range 1 0*^-1 0~^) were made from the original virus stock and the 
number of particles per (xL was determined based upon RT activity using a modified STF-PERT assay 
(designated as two-step fluorescent-PERT; TSF-PERT). The number of particles in the original stock 
was determined as the average of the calculated particles at each dilution based upon a standard curve 
using HIV-1 RT enzyme (Worthington Biochemical Corporation Lakewood, NJ, USA, cat. No. LS05006; 
lot No. X1H2839), which was determined to have 262 pU of RT activity per particle [40]. The 
STF-PERT one-step assay [40] was modified into a two-step assay that no longer requires the use of 
AmpliWax® PCR Gem 50, a product that has been discontinued. In addition, the TSF-PERT assay was 
designed as a partially automated process compatible with the Eppendorf epMotion 5070 robot 
(Eppendorf, Hauppauge, NY, USA, cat. No. 960000111). The RT master mix, standards, and samples 
were manually added to the 96-well plate as previously described [40] and the RT reaction was carried 
out using an Eppendorf MasterCycler ProS (Eppendorf, cat. No. 950030020). Once complete, the RT 
plate was placed on a 96-well cold Thermoblock (Eppendorf, cat. No. 960002083) and the PCR master 
mix was placed on a TMX 24 x SafeLock Thermorack (Eppendorf, cat. No. 960002070). The 
Eppendorf epMotion 5070 robot dispensed 25 |a,L of the PCR master mix into each well. Finally, the 
plate was sealed and processed for Taqman qPCR. The results indicated that XMRV had 3.12 x 10^ 
particles per |4,L and SFV-1 had 4.02 x 10^ particles per |iL. 

Total viral RNA was extracted from each dilution using the QIAamp viral RNA mini kit (Qiagen, 
Valencia, CA, USA, cat. No. 52904) in combination with the RNase-free DNase I set (Qiagen, 
cat. No. 79254) according to the manufacturer's instructions. Briefiy, 200 \iL of each dilution was 
processed as specified by the QIAamp viral RNA mini kit with an added on-column DNase I treatment. 
The RNA was eluted in 200 |a,L of DNase/RNase-free water. Concentration and purity were determined by 
using UV absorbance. 

Viral RNA panels were made in the presence and absence of background Sf9 cell nucleic acids. 
Total cell nucleic acid was extracted from Sf9 cell pellets using the QIAamp® MinElute® Virus Spin 
Kit (Qiagen, cat. No. 57704). Briefly, cell peUets (2x10^ cells) were resuspended in 200 \xL buffer 
and processed as specified in the QIAamp® MinElute® Virus Spin Kit followed by elution in 50 \iL of 
DNase/RNase-free water. RNA panels (10 -10 ) were initially made in a background of 10 cell 
equivalents of Sf9 total nucleic acids or in water to create 10-fold and 100-fold dilutions for testing in 
a background of 10^ and 10"* cell equivalents of Sf9 total nucleic acid. Selected samples were tested in 
different assays. 

cDNA was sjoithesized using iScript cDNA sjoithesis kit (Bio-Rad, Hercules, CA, USA, cat. No. 
170-8890) according to the manufacturer's instructions. An XMRV gag fragment (500 bp) was 
amplified by a nested PCR assay using viral cDNA (2 |a,L) with an annealing temperature of 63 °C for 
the outer primers, as previously described [41]. An SFV-1 gag fragment (548 bp) fragment was also 
amplified by a nested PCR assay. The first amplification was done with primers SFVgagFl (5'-AACCTAG 
GTGGAGAGCTGAAGG-3') and SFVgagRl (5'-ATGGAGAGGGTAAGAACCATGGG-3') to generate 
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a 947 bp fragment. This was followed by a second amplification with primers SFVgagF2 (5'-ATGAT 
GCACTTTGGCAGCCATTGG-3') and SFVgagR2 (5'-ACCTGCTGAATGTTGATTCTGTGC-3') to 
generate a 578 bp fragment. Viral cDNA (2 ^L) was used in a 50 \iL reaction containing 1.5 U Taq 
DNA Polymerase (Roche Applied Science), 0.5 mM primers, 0.2 mM dNTP mix, and 1 x PGR buffer 
with 1.5 mM MgCli (Roche Applied Science, Indianapolis, IN, USA). The PGR conditions used were: 
94 °G for 3 min, (94 °G for 30 s, 55 °G for 1 min, 72 °G for 1 min) x 35 cycles, 72 °G for 10 min, and 
a 4 °G hold for infinity. In each case a control sample without reverse transcriptase (RT) was included. 

DNA fragments were visualized with ethidium bromide staining after running 10 |a,L of each 
PGR reaction on a 1.4% agarose gel. DNA sizes were determined using a 100 bp DNA ladder (New 
England Biolabs, Ipswich, MA, USA, cat. No. N3231L). 

3.3. PCR/ESI-MS (PLEX-ID) 

Gell pellets (2 x 10^ cells) were processed in our laboratory or were sent to Athogen for processing [20]. 
In addition, total nucleic acids were extracted in our laboratory from PBS, ultra-pure dH20, complete 
medium (DMEM supplemented with 10% FBS (Hyclone, Logan, UT, USA, cat. No. SH30071.07, 
lot. No. ATE32066), 100 U of penicillin per ml and 100 \ig of streptomycin per mL, L-glutamine and 
Ix MEM-NEAA), and trypsin (Gibco, Grand Island, NY, USA, cat. No. 25300-054, lot No 917992). 
Briefly, cell pellets were resuspended in 200 \iL AVE buffer or 200 \iL of liquid sample were 
processed using the QIAamp® MinElute® Virus Spin Kit (Qiagen, cat. No. 57704) as described by the 
manufacturer. Total nucleic acids were eluted in 200 \iL of DNAse/RNAse-free water. All samples 
were analyzed using PLEX-ID Biopharma Viral Assay Kits (Ibis Biosciences, Abbott, Carlsbad, CA, 
USA): Bovine origin virus, Porcine origin virus, CHO cell virus, and Other virus, either in our laboratory 
using an on-site PLEX-ID or sent to Athogen for testing. In all cases, a one-step 50 \iL RT-PCR reaction 
containing 5 \iL of total nucleic acids extract was performed as previously described [42]. All PCRs 
were conducted in a 96-well plate format using an Eppendorf Mastercycler under conditions 
previously described [14,42]. Samples were processed by an automated carousel contained with the 
PLEX-ID machine, where PGR samples were desalted and injected into the ESI-TOF mass 
specfrometer. Once the signal was processed, genomic signatures were generated by Ibis software and 
compared against Athogen' s curated database. 

3.4. LLMDA and Virohip 

RNA, DNA, and total nucleic acid samples were sent to Lawrence Livermore National Laboratory 
(LLNL, Livermore, CA, USA) and University of California, San Francisco (UCSF, San Francisco, 
GA, USA) to be analyzed by LLMDA [4] and ViroChip v.5 [2,21,22], respectively. Briefly, each RNA 
sample (10 |a,L for the LLMDA and 11 |iL for the Virochip) was reverse-transcribed to cDNA using 
random primers and PGR-amplified prior to labeling with Gy3 fluorescent dye and hybridization to the 
arrays [2,4]. 

An updated LLMDA v.5 that was designed to detect around 5500 microbial species, which were 
sequenced through December 201 1, was used for our studies. This study employed the 12-plex 135 K 
format of this array, which is restricted to pathogens associated with vertebrate infection, and includes 
1856 viral, 1398 bacterial, 123 archaean, 48 fungal, and 94 protozoan species. In the LLMDA v.5 [4], 
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in order to expand the abilities to detect these viruses from cell lines probes were also designed to 
detect additional mammalian endogenous retroviruses and some insect viruses. These included: 6 
eukaryotic sequences (Drosophila simulans clone F pop-variant Chicharo (Portugal) retrotransposon 
tirant envelope protein (env) gene partial cds (Genbank ID JN786 103.1), Drosophila melanogaster 
copia-like element 17.6 (Genbank ID X01472.1), Drosophila melanogaster Idefix retroelement: gag 
pol and env genes partial (Genbank ID, AJ009736.1), Drosophila melanogaster transposable element 
297 (Genbank ID, X03431.1), Drosophila melanogaster clone 4.1 gypsy retrotransposon MDG4 complete 
sequence (Genbank ID, DQ887 187.1), Homo sapiens gypsy retrotransposon integrase 1 (GINl) mRNA 
(Genbank ID, NM_01 7676.2); 9 partial sequences of Spodoptera frugiperda insect viruses [33], 
1 baculovirus (Autographa califomica nuclear polyhedrosis virus (AcMNPV) mutant FP-D with 
incorporated Trichoplusia ni retrotransposon TED (Tn368) and three open reading frames (Genbank 
ID M32662.1); and 3 endogenous retroviruses {Odocoileus hemionus endogenous virus Cervid endogenous 
retrovirus CrERVg complete genome (Genbank ID JN592050.1), Simian retrovirus 1 isolate 
SRV Vero Assembled gag pseudogene partial sequence; pol protein (pol) gene partial cds; and env 
pseudogene complete sequence (Genbank ID HM 143 845.1), and Python molurus endogenous 
retrovirus Gag-Pro-Pol protein and Env genes complete cds (Genbank ID AF500296.1). 

The ViroChip v. 5 used in this study was an 8-plex 60K format and was updated to include 
60,000 probes representing complete and partial sequences of all viral genomes in GenBank as of 
December 2010 [21,22]. 

3.5. PCR Assay for RDl 14-Like Virus and Nucleotide Sequencing Analysis 

A positive hit for RDl 14 by PLEX-ID was verified using a nested PCR assay and nucleotide 
sequencing. PCR primers were developed based upon the PLEX-ID broad-range forward and reverse 
primer set 3417 sequences (kindly provided by Rangarajan Sampath, Ibis Biosciences, Abbott, 
Carlsbad, CA): outer primers were 3417F (5'-TGGGGATTGATTGGAAATTAC-3') and 3417R 
(5'-TGTTCTATTCATTCTTTCTAC-3'); an inner primer set was designed: 3417F2 (5'-TATTCATT 
CTTTCTACCTGTCGTG-3') and 3417R2 (5'-ATTGATTGGAAATTACATTGTGC-3'). Total nucleic 
acids (5 ^L) from MDCK cells were amplified in a PCR reaction of total volume 50 \iL containing 1.5 U 
Taq DNA Polymerase (Roche Applied Science, cat. No. 11647687001), 0.5 mM primers, 0.2 mM 
dNTP mix, and Ix PCR buffer with 1.5 mM MgCh (Roche Applied Science, cat No 11647687001). 
The PCR conditions used were: 95 °C for 5 min (95 °C for 30 s, 55 °C for 1 min, 72 °C for 1 min) x 
35 cycles, 72 °C for 10 min, and a 4 °C hold for infinity. A 74 bp fragment was amplified after the first 
round of PCR using the outer primers and a 64 bp fragment was amplified by a second amplification 
with the inner primers. 

DNA fragments were visualized by ethidium bromide staining after running 20 i^L of the PCR 
reaction on a 4% NuSieve GTG gel (Lonza, Rockland, ME, USA, cat. No. 50081). DNA sizes were 
determined using 0X174 DNAJBsuR I (Hae III) marker (Fermentas, Inc., Glen Bumie, MD, USA, 
cat. No. SM0253). DNA fragments were purified from the gel fragment by using Zymoclean™ Gel 
DNA Recovery Kit (Zymo Research Corp., Orange, CA, USA, cat. No. D4001) according to the 
manufacturer ' s instructions . 
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The purified PCR fragments were cloned into pGEM-T Easy Vector (Promega, Fitchburg, WI, 
USA, cat. No. A1380) as per the manufacturer's instructions and transformed into JM109 high 
efficiency competent cells. Several colonies were picked and were grown in S.O.C. medium 
(Invitrogen, Carlsbad, CA, USA, cat. No. 15544-034) at 37 °C overnight. Plasmid preps were made 
using QIAprep® Spin Miniprep kit (Qiagen, cat. No. 27106) and sequenced with the T7 primer using 
Big Dye v3.1 (Applied Biosystems, Foster City, CA, USA, cat. No. 4337455). Nucleotide sequences 
were generated on an ABI 3130x/ Genetic Analyzer and sequence analysis was done using Vector NTI 
(Invitrogen). The identity of the PCR fragments amplified from the first and second PCR 
amplifications was confirmed by nucleotide sequencing; since both had similar overlapping sequences 
the larger 74 bp sequence was used for subsequent analysis. 

Bioinformatic and phylogentic analysis were done using BLAST (National Center for Biotechnology 
Information, National Library of Medicine, NIH, Bethesda, MD, USA), CLC Genomics workbench 
(CLC Bio., Cambridge, MA, USA), RetroTector® (Uppsala Universitet, Uppsala, Sweden), and 
MEGA 5.1 (Molecular Evolutionary Genetics Analysis, Tempe, AZ, USA). 

4. Conclusions 

The results showed that the LOD of PLEX-ID for detection of XMRV was equivalent to RT-PCR 

in the absence of background nucleic acids, and greater by one log in the presence of 10^ cell 
equivalents of Sf9 total nucleic acids (Table 1). However, SFV-1 could not be detected by the PLEX-ID 
due to the absence of relevant primers. LLMDA was a log more sensitive for detection of both XMRV 
and SFV-1 as compared to PLEX-ID and RT-PCR, but LLMDA could not detect these viruses in a 
background of 10"* cell equivalents of Sf9 total nucleic acids. Selected samples of the XMRV (RNA 
dilutions 10"^- 10"^) and SFV-1 RNA panels (RNA dilutions 10"^- 10"^) were also tested with the 
ViroChip. In the absence of Sf9 total nucleic acids, the XMRV 10"^ RNA dilution sample was detected 
by the ViroChip, whereas, the SFV-1 RNA dilution 10"^ was not detected. In addition to the LOD 
studies, we used the new virus detection methods to analyze some vaccine -related cell substrates for 
the potential presence of novel viruses. Total nucleic acids were used for the detection of both DNA 
and RNA viruses. Analysis of cell lines by PLEX-ID and virus microarrays indicated the presence of 
various refrovirus-related sequences. However, the origin of these sequences could be attributed to the 
presence of cellular DNA containing endogenous retroviral sequences or viral transcripts that may be 
expressed constitutively from some endogenous refroviral DNAs (proviruses) [43]. However, this 
situation is applicable to all nucleic acid-based assays that can detect retroviruses. To assess the origin 
and nature of the refroviral sequences a follow up sfrategy using PCR, sequencing, and bioinformatics 
was developed. The analysis revealed that the unexpected RDl 14 sequences detected by PLEX-ID in 
canine cell lines were associated with defective endogenous gammarefro viruses. Similarly, blast 
analysis and amplicon base count comparisons also revealed that endogenous retrovirus sequences 
were responsible for positive hits within the alpharefrovirus and betarefrovirus families detected in 
human cell lines. 

Our analysis further showed that different viruses were detected by using different technologies. 
These results emphasize the need to use a combination strategy for virus detection in evaluating the 
safety of biologicals or screening patient samples. It was noted that some expected virus sequences that 
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were known to be present as fragments in some cells lines were not detected by any of the methods: 
e.g., adenovirus in HEK293 cells and 293T/17 cells, and HPV in HeLa cells. This may be due to the 
absence of primers for the viral specific regions in the assays. These results also indicate the need for 
further updates of the PLEX-ID panels and virus arrays to enhance detection of known and novel viruses. 

Acknowledgments 

We thank Rangarajan Sampath and Cicely A Washington (Ibis Biosciences, Abbott, Carlsbad, CA, 
USA), for PLEX-ID analysis and training; Crystal J. Jaing and Kevin S. McLoughlin (LLNL, 
Livermore, CA, USA) for LLMDA analysis; Shea Gardner (LLNL) for the probe design of the updated 
LLMDV V.5; and Charles Chiu and Erik Samayoa (UCSF, San Francisco, CA, USA) for the ViroChip 
analysis. We also thank Kostantin Chumakov and Andrew Daj^on for reading the paper. The work at 
LLNL was done through lAA # 1032444. The content of this publication does not necessarily reflect 
the views or policies of the Department of Health and Human Services, nor does mention of trade 
names, commercial products, or organizations imply endorsement by the U.S. Government. The 
findings and conclusions in this article have not been formally disseminated by the Food and Drug 
Administration and should not be construed to represent any Agency determination or policy. This 
project was supported by funds fi-om the Medical Countermeasures Initiative and Pandemic Influenza. 

Author Contributions 

Lanjai P. Taliaferro conducted the PLEX-ID and RT-PCR experiments and data analysis; Dhanya 
K. Williams performed the initial PLEX-ID performance analysis; Teresa A. Galvin and Lanyn 
Taliaferro performed the RetroTector analysis; Hailun Ma and Syed Shaheduzzaman provided critical 
reagents and protocols for viral RNA preparations; Dustin R. Glasner established the two-step STF- 
PERT assay and performed the PERT experiments; Arifa S. Khan designed the study and provided 
scientific oversight; Arifa S. Khan and Lanjoi P. Taliaferro prepared the manuscript. 

Conflicts of Interest 

The authors declare no conflict of interest. 
References 

1. Blyn, L.B.; Hall, T.A.; Libby, B.; Ranken, R.; Sampath, R.; Rudnick, K.; Moradi, E.; Desai, A.; 
Metzgar, D.; Russell, K.L.; et al. Rapid detection and molecular sero typing of adenovirus by use 
of PCR followed by electrospray ionization mass spectrometry. J. Clin. Microbiol. 2008, 46, 
644^651. 

2. Chen, E.C.; Miller, S.A.; DeRisi, J.L.; Chiu, C.Y. Using a pan-viral microarray assay (Virochip) 
to screen clinical samples for viral pathogens. J. Vis. Exp. 2011, doi: 10.3791/2536. 

3. Ecker, D.J.; Sampath, R.; BXja, L.B.; Eshoo, M.W.; Ivy, C; Ecker, J.A.; Libby, B.; Samant, V.; 
Sannes-Lowery, K.A.; Melton, R.E.; et al. Rapid identification and strain-typing of respiratory 
pathogens for epidemic surveillance. Proc. Natl. Acad. Sci. USA 2005, 102, 8012-8017. 



Viruses 2014, 6 



1894 



4. Gardner, S.N.; Jaing, C.J.; McLoughlin, K.S.; Slezak, T.R. A microbial detection array (MDA) 
for viral and bacterial detection. BMC Genomics 2010, 11, e668. 

5. Victoria, J.G.; Wang, C; Jones, M.S.; Jaing, C; McLoughlin, K.; Gardner, S.; Delwart, E.L. Viral 
nucleic acids in live-attenuated vaccines: Detection of minority variants and an adventitious virus. 
J. Virol. 2010, 84, 6033-6040. 

6. Onions, D.; Kolman, J. Massively parallel sequencing, a new method for detecting adventitious 
agents. Biologicals 2010, 38, 377-380. 

7. European Medicines Agency. Guideline on the use of porcine trypsin used in the manufacture of 
human biological medicinal products. (21 Feburary 2013, draft) Available online: ht^://www.ema. 
europa.eu/docs/en_GB/document_library/Scientific_guideline/2013/03AVC500139532.pdf (accessed 
on 25 September 2013). 

8. Ma, H.; Khan, A.S. Detection of latent retroviruses in vaccine-related cell substrates: Investigation 
of RT activity produced by chemical induction of vero cells. PDA J. Pharm. Sci. Technol. 2011, 

65, 685-689. 

9. Onions, D.; Cote, C; Love, B.; Toms, B.; Koduri, S.; Armstrong, A.; Chang, A.; Kolman, J. 
Ensuring the safety of vaccine cell substrates by massively parallel sequencing of the transcriptome. 
Faccme 2011, 29, 7117-7121. 

10. Sampath, R.; Blyn, L.B.; Ecker, D.J. Rapid molecular assays for microbial contaminant 
monitoring in the bioprocess industry. PDA J. Pharm. Sci. Technol. 2010, 64, 458-464. 

1 1 . FDA. Characterization and qualification of cell substrates and other biological materials used in 
the production of viral vaccines for infectious disease indications February 2010. Available online: 
http://www.fda.gov/downloads/biologicsbloodvaccines/gmdancecomplianceregulatoryinformation/ 
guidances/vaccines/ucm20243 9.pdf (accessed on 25 September 2013). 

12. Khan, A.S. Testing considerations for novel cell substrates: A regulatory perspective. PDA J. 
Pharm. Sci. Technol. 2010, 64, 426^31. 

13. Khan, A.S.; Ma, W.; Ma, Y.; Kumar, A.; Williams, D.K.; MuUer, J.; Ma, H.; Galvin, T.A. 
Proposed algorithm to investigate latent and occult viruses in vaccine cell substrates by chemical 
induction. Biologicals 2009, 37, 196-201. 

14. Eshoo, M.W.; Whitehouse, C.A.; ZoU, S.T.; Massire, C; Pennella, T.T.; Blyn, L.B.; Sampath, R.; 
Hall, T.A.; Ecker, J. A.; Desai, A.; et al. Direct broad-range detection of alphaviruses in mosquito 
extracts. Virology 2007, 368, 286-295. 

15. Hofstadler, S.A.; Sampath, R.; Blyn, L.B.; Eshoo, M.W.; Hall, T.A.; Jiang, Y.; Drader, J.J.; 
Hannis, J.C.; Sannes-Lowery, K.A.; Cummins, L.L.; et al. TIGER: The universal biosensor. Int. J. 
Mass Spectrom. 2005, 242, 23-41. 

16. Ecker, D.J.; Sampath, R.; Willett, P.; Samant, V.; Massire, C; Hall, T.A.; Hari, K.; McNeil, J.A.; 
Buchen-Osmond, C; Budowle, B. The Microbial Rosetta Stone database: A common structure for 
microbial biosecurity threat agents. J. Forensic. Sci. 2005, 50, 1380-1385. 

17. Ecker, D.J.; Sampath, R.; Massire, C; Blyn, L.B.; Hall, T.A.; Eshoo, M.W.; Hofstadler, S.A. Ibis 
T5000: A universal biosensor approach for microbiology. Nat. Rev. Microbiol. 2008, 6, 553-558; 

18. Sampath, R.; MulhoUand, N.; Blyn, L.B.; Massire, C; Whitehouse, C.A.; Waybright, N.; Harter, C; 
Bogan, J.; Miranda, M.S.; Smith, D.; et al. Comprehensive biothreat cluster identification by 
PCR/electrospray-ionization mass spectrometry. PLoS One 2012, 7, e36528. 



Viruses 2014, 6 



1895 



19. Sampath, R.; Hofstadler, S.A.; Blyn, L.B.; Eshoo, M.W.; Hall, T.A.; Massire, C; Levene, H.M.; 
Hannis, J.C.; Harrell, P.M.; Neuman, B.; et al. Rapid identification of emerging pathogens: 
Coronaviras. Emerg. Infect. Dis. 2005, 11, 373-379. 

20. Athogen, Ibis Biosciences, Carlsbad, CA, USA. Available online: http://www.athogen.coin/ 
consulting-services/biopharmaceutical-tests.html (accessed on 20 September 2013). 

21. Greninger, A.L.; Chen, E.C.; Sittler, T.; Scheinerman, A.; Roubinian, N.; Yu, G.; Kim, E.; 
Pillai, D.R.; Guyard, C; MazzuUi, T.; et al. A metagenomic analysis of pandemic influenza A 
(2009 HlNl) infection in patients from North America. PLoS One 2010, 5, el3381 

22. Lee, D.; das Gupta, J.; Gaughan, C; Steffen, I.; Tang, N.; Luk, K.C.; Qiu, X.; Urisman, A.; 
Fischer, N.; Molinaro, R.; et al. In-depth investigation of archival and prospectively collected 
samples reveals no evidence for XMRV infection in prostate cancer. PLoS One 2012, 7, e44954. 

23. Galvin, T.A.; Ahmed, I.A.; Shahabuddin, M.; Bryan, T.; Khan, A.S. Identification of recombination in 
the envelope gene of simian foamy virus sertype 2 isolated from Macaca cyclopis. J. Virol. 2013, 
87, 8792-8797. 

24. Rose, N.; Opriessnig, T.; Grasland, B.; Jestin, A. Epidemiology and transmission of porcine 
circovirus type 2 (PCV2). Virus Res. 2012, 164, 78-79. 

25. Van Regenmortel, M.H.V.; Fauquet, CM.; Bishop, D.H.L.; Carstens, E.B.; Estes, M.K.; 
Lemon, S.M.; Maniloff, J.; Mayo, M.A.; McGeoch, D.J.; Pringle, C.R.; et al. Virus Taxonomy, 
Classification and Nomenclature of Viruses: Seventh Report of the International Committee on 
Taxonomy of Viruses; Academic Press: San Diego, CA, USA, 2000. 

26. Gifford, R.; Tristem, M. The evolution, distribution and diversity of endogenous retroviruses. 
Virus Genes 2003, 26, 291-315. 

27. Ma, H.; Ma, Y.; Ma, W.; Williams, D.K.; Galvin, T.A.; Khan, A.S. Chemical induction of 
endogenous retrovirus particles from the vero cell line of African green monkeys. J. Virol. 2011, 
85, 6579-6588. 

28. DuBridge, R.B.; Tang, P.; Hsia, H.C.; Leong, P.M.; Miller, J.H.; Calos, M.P. Analysis of 
mutation in human cells by using an Epstein-Barr virus shuttle system. Mol. Cell. Biol. 1987, 7, 
379-387. 

29. Graham, F.L.; Smiley, J.; Russell, W.C.; Nairn, R. Characteristics of a human cell line transformed by 
DNA from human adenovirus type 5. J. Gen. Virol. 1977, 36, 59-74. 

30. Lindahl, T.; Adams, A.; Bjursell, G.; Bornkamm, G.W.; Kaschka-Dierich, C; Jehn, U. Covalently 
closed circular duplex DNA of Epstein-Barr virus in a human lymphoid cell line. J. Mol. Biol. 
1976,702,511-530. 

31. Meissner, J.D. Nucleotide sequences and further characterization of human papillomavirus DNA 
present in the CaSki, SiHa and HeLa cervical carcinoma cell lines. J. Gen. Virol. 1999, 80, 
1725-1733. 

32. Menzel, T.; Rohrmann, G.F. Diversity of errantivirus (retrovirus) sequences in two cell lines used 
for baculovirus expression, Spodoptera frugiperda and Trichoplusia ni. Virus Genes 2008, 36, 
583-586. 

33. Ma, H.; Galvin, T.A.; Glasner, D.R.; Shaheduzzaman, S.; Khan, A.S. Identification of a novel 
rhabdovirus in Spodoptera frugiperda cell lines. J. Virol. 2014, doi:10.1 128/JVI.00780-14. 



Viruses 2014, 6 



1896 



34. Kimura, M. A simple method for estimating evolutionary rates of base substitutions through 
comparative studies of nucleotide sequences. J. Mol. Evol. 1980, 16, \ \ 1-120. 

35. Felsenstein, J. Confidence limits on phylogenies: An approach using the bootstrap. Evolution 
1985, 39, 783-791. 

36. Tamura, K.; Peterson, D.; Peterson, N.; Stecher, G.; Nei, M.; Kumar, S. MEGA5: Molecular 
evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum 
parsimony methods. Mol. Biol. Evol. 2011, 28, 2731-2739. 

37. Barrio, A.M.; Ekerljung, M.; Jem, P.; Benachenhou, F.; Sperber, G.O.; Bongcam-Rudloff, E.; 
Blomberg, J.; Andersson, G. The first sequenced carnivore genome shows complex host-endogenous 
retrovirus relationships. PLoS One 2011, 6, el9832. 

38. Sperber, G.; Lovgren, A.; Eriksson, N.E.; Benachenhou, F.; Blomberg, J. RetroTector online, 
a rational tool for analysis of retroviral elements in small and medium size vertebrate genomic 
sequences. BMC Bioinf. 2009, 10, S4. 

39. Williams, D.K.; Galvin, T.A.; Gao, Y.; O'Neill, C; Glasner, D.; Khan, A.S. No evidence of 
xenotropic murine leukemia virus-related virus transmission by blood transfusion from infected 
rhesus macaques. J. Virol. 2013, 87, 2278-2286. 

40. Ma, Y.K.; Khan, A.S. Evaluation of different RT enzyme standards for quantitation of retroviruses 
using the single-tube fluorescent product-enhanced reverse transcriptase assay. J. Virol. Methods 
2009, 157, 133-140. 

41. Williams, D.K; Galvin, T.A.; Ma, H.; Khan, A.S. Investigation of xenotropic murine leukemia 
virus-related virus (XMRV) in human and other cell lines. Biologicals 2011, 39, 378-383. 

42. Sampath, R.; Russell, K.L.; Massire, C; Eshoo, M.W.; Harpin, V.; Blyn, L.B.; Melton, R.; Ivy, C; 
Pennella, T.; Li, F.; et al. Global surveillance of emerging Influenza virus genotypes by mass 
spectrometry. PLoS One 2007, 2, e489. 

43. Coffin, J.M. Endogenous retroviruses. In RNA Tumor Viruses, 2nd ed.; Weiss, R., Teich, N., 
Varmus, H., Coffin, J., Eds.; Cold Spring Harbor Laboratory: Cold Spring Harbor, NY, USA, 
1984; pp. 1109-1203. 

© 2014 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article 
distributed under the terms and conditions of the Creative Commons Attribution license 
(http://creativecommons.Org/licenses/by/3.0/). 



